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ABSTRACT. The purpose of this paper is to provide a complete probabilistic analysis of a large class 
of stochastic differential games for which the interaction between the players is of mean-field type. 
We implement the Mean-Field Games strategy developed analytically by Lasry and Lions in a purely 
probabilistic framework, relying on tailor-made forms of the stochastic maximum principle. While 
we assume that the state dynamics are affine in the states and the controls, our assumptions on the 
nature of the costs are rather weak, and surprisingly, the dependence of all the coefficients upon the 
statistical distribution of the states remains of a rather general nature. Our probabilistic approach calls 
for the solution of systems of forward-backward stochastic differential equations of a McKean-Vlasov 
type for which no existence result is known, and for which we prove existence and regularity of the 
corresponding value function. Finally, we prove that solutions of the mean-field game as formulated by 
Lasry and Lions do indeed provide approximate Nash equilibriums for games with a large number of 
players, and we quantify the nature of the approximation. 



1. Introduction 

In a trailblazing contribution, Lasry and Lions |fl9l [201 l2Tj proposed a methodology to produce 
approximate Nash equilibriums for stochastic differential games with symmetric interactions and 
a large number of players. In their model, the costs to a given player feel the presence and the 
behavior of the other players through the empirical distribution of their private states. This type of 
interaction was introduced and studied in statistical physics under the name of mean-field interaction, 
allowing for the derivation of effective equations in the limit of asymptotically large systems. Using 
intuition and mathematical results from propagation of chaos, Lasry and Lions propose to assign to 
each player, independently of what other players may do, a distributed closed loop strategy given by 
the solution of the limiting problem, arguing that such a resulting game should be in an approximate 
Nash equilibrium. This streamlined approach is very attractive as large stochastic differential games 
are notoriously nontractable. They formulated the limiting problem as a system of two highly coupled 
nonlinear partial differential equations (PDE for short): the first one, of the Hamilton- Jacobi-Bellman 
type, takes care of the optimization part, while the second one, of Kolmogorov type, guarantees the 
time consistency of the statistical distributions of the private states of the individual players. The 
issue of existence and uniqueness of solutions for such a system is a very delicate problem, as the 
solution of the former equation should propagate backward in time from a terminal condition while 
the solution of the latter should evolve forward in time from an initial condition. More than the 
nonlinearities, the conflicting directions of time compound the difficulties. 
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In a subsequent series of works (H HU [TUl \Tj\ [TBI with PhD students and postdoctoral fellows, 
Lasry and Lions considered applications to domains as diverse as the management of exhaustible 
resources like oil, house insulation, and the analysis of pedestrian crowds. Motivated by problems in 
large communication networks, Caines, Huang and Malhame introduced, essentially at the same time 
lfT4l . a similar strategy which they call the Nash Certainty Equivalence. They also studied practical 
applications to large populations behavior |fl3l . 

The goal of the present paper is to study the effective Mean-Field Game equations proposed by 
Lasry and Lions, from a probabilistic point of view. To this end, we recast the challenge as a fixed 
point problem in a space of flows of probability measures, show that these fixed points do exist and 
provide approximate Nash equilibriums for large games, and quantify the accuracy of the approxima- 
tion. 

We tackle the limiting stochastic optimization problems using the probabilistic approach of the 
stochastic maximum principle, thus reducing the problems to the solutions of Forward Backward 
Stochastic Differential Equations (FBSDEs for short). The search for a fixed flow of probability 
measures turns the system of forward-backward stochastic differential equations into equations of 
the McKean-Vlasov type where the distribution of the solution appears in the coefficients. In this 
way, both the optimization and interaction components of the problem are captured by a single FB- 
SDE, avoiding the twofold reference to Hamilton- Jacobi-Bellman equations on the one hand, and 
Kolmogorov equations on the other hand. As a by-product of this approach, the stochastic dynamics 
of the states could be degenerate. We give a general overview of this strategy in Section [2] below. 
Motivated in part by the works of Lasry, Lions and collaborators, Backward Stochastic Differential 
Equations (BSDEs) of the mean field type have recently been studied. See for example EHl. How- 
ever, existence and uniqueness results for BSDEs are much easier to come by than for FBSDEs, and 
here, we have to develop existence results from scratch. 

Our first existence result is proven for bounded coefficients by means of a fixed point argument 
based on Schauder's theorem pretty much in the same spirit as Cardaliaguet's notes Q. Unfor- 
tunately, such a result does not apply to some of the linear-quadratic (LQ) games already studied 
lfI31 [U 12 El , and some of the most technical proofs of the papers are devoted to the extension of this 
existence result to coefficients with linear growth. See Section [3] Our approximation and conver- 
gence arguments are based on probabilistic a priori estimates obtained from tailor-made versions of 
the stochastic maximum principle which we derive in Section [2| The reader is referred to the book 
of Ma and Yong [22 J for background material on adjoint equations, FBSDEs and the stochastic max- 
imum principle approach to stochastic optimization problems. As we rely on this approach, we find 
it natural to derive the compactness properties needed in our proofs from convexity properties of the 
coefficients of the game. The reader is also referred to the papers by Hu and Peng [12J and Peng 
and Wu ll23ll for general solvability properties of standard FBSDEs within the same framework of 
stochastic optimization. 

The thrust of our analysis is not limited to existence of a solution to a rather general class of 
McKean-Vlasov FBSDEs, but also to the extension to this non-Markovian set-up of the construction 
of the FBSDE value function expressing the solution of the backward equation in terms of the solution 
of the forward dynamics. The existence of this value function is crucial for the formulation and the 
proofs of the results of the last part of the paper. In Section [4j we indeed prove that the solutions 
of the fixed point FBSDE (which include a function a minimizing the Hamiltonian of the system, 
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three stochastic processes (Xt, Yt, Zt)o<t<T solving the FBSDE, and the FBSDE value function u) 
provide a set of distributed strategies which, when used by the players of a TV-player game, form an 
ejv-approximate Nash equilibrium, and we quantify the speed at which ejy tends to when N — > +00. 
This type of argument has been used for simpler models in or (5|. Here, we use convergence 
estimates which are part of the standard theory of propagation of chaos (see for example lT26l [T61) and 
the Lipschitz continuity and linear growth the FBSDE value function u which we prove earlier in the 
paper. 

2. General Notation and Assumptions 

Here, we introduce the notation and the basic tools from stochastic analysis which we use through- 
out the paper. 

2.1. The N Player Game. We consider a stochastic differential game with N players, each player 
i G {1, • • • , N} controlling his own private state C/| G R d at time t G [0, T] by taking an action f3\ in 
a set A C R k . We assume that the dynamics of the private states of the individual players are given 
by Ito's stochastic differential equations of the form 

(1) dUl = b% Ul ft)dt + o*(t, Ul u t N , Pi)dWl < t < T, i = 1, • • • , N, 

where the W l = (W t l )o<t<T are m-dimensional independent Wiener processes, (b l ,a l ) : [0, T] x 
R d x V(R d ) xAhM^x 1^™ are deterministic measurable functions satisfying a set of assumptions 
spelled out below, and vf denotes the empirical distribution of XJ% = • • • , J7 t ) defined as 

N 



^c-. N 

8=1 



Here and in the following, we use the notation S x for the Dirac measure (unit point mass) at x, and 
V(E) for the space of probability measures on E whenever E is a topological space equipped with 
its Borel er-field. In this framework, V{E) itself is endowed with the Borel er-neld generated by the 
topology of weak convergence of measures. 

Each player chooses a strategy in the space A = M 2,k of progressively measurable ^4-valued 
stochastic processes /3 = (f3t)o<t<T satisfying the admissibility condition: 

cT 



(2) E 



W 2 dt 







< +00. 



The choice of a strategy is driven by the desire to minimize an expected cost over the period [0, T], 
each individual cost being a combination of running and terminal costs. For each i G {1, • • • , N}, 
the running cost to player i is given by a measurable function p : [0, T] x R d x V(R d ) x A ^ R 
and the terminal cost by a measurable function g l : R d x V(R d ) R in such a way that if the N 
players use the strategy /3 = (/3 1 , • • • , f3 N ) G A^, the expected total cost to player i is 

r r T 

(3) J\(3) = E 



9 i m,^) + [ f{t,Ul,u t N ,Pl)dt 
Jo 



Here A^ denotes the product of N copies of A. Later in the paper, we let N — > 00 and use the notation 



J '* in order to emphasize the dependence upon N. Notice that even though only f3\ appears in the 
formula giving the cost to player i, this cost depends upon the strategies used by the other players 
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indirectly, as these strategies affect not only the private state JJ\, but also the empirical distribution 
of all the private states. As explained in the introduction, our model requires that the behaviors 
of the players be statistically identical, imposing that the coefficients b % , a 1 , f l and g l do not depend 
upon i. We denote them by b, a, f and g. 

In solving the game, we are interested in the notion of optimality given by the concept of Nash 
equilibrium. Recall that a set of admissible strategies a* = (a* 1 , • • • , a* N ) G A^ is said to be a 
Nash equilibrium for the game if 



where we use the standard notation (a* 1 , a 1 ) for the set of strategies (a* 1 , • • • , a* N ) where a* 1 has 
been replaced by a 1 . 

2.2. The Mean-Field Problem. In the case of large symmetric games, some form of averaging is 
expected when the number of players tends to infinity. The Mean-Field Game (MFG) philosophy 
of Lasry and Lions is to search for approximate Nash equilibriums through the solution of effective 
equations appearing in the limiting regime N — > oo, and assigning to each player the strategy a 
provided by the solution of the effective system of equations they derive. In the present context, the 
implementation of this idea involves the solution of the following fixed point problem which we break 
down in three steps for pedagogical reasons: 

(i) Fix a deterministic function [0, T] B t ^ p t G V(R d ); 

(ii) Solve the standard stochastic control problem 



(4) aeA [Jq 

subject to dX t = b(t, X t , p t , a t )dt + a(t, X t , p t , a t )dW t ; X = x . 

(iii) Determine the function [0, T]3i4/j t £ V{R d ) so that Vt G [0, T), F Xt = [H- 
Once these three steps have been taken successfully, if the fixed-point optimal control a identified in 
step (ii) is in feedback form, i.e. of the form at = a(t, X t , IPxJ for some function a on [0, T] x 
M. d x V(M. d ), denoting by fx t = Fx t the fixed-point marginal distributions, the prescription a\* = 
a(t, XI, fit), if used by the players i = 1, • • ■ , N of a large game, should form an approximate Nash 
equilibrium. We prove this fact rigorously in Section [4] below, and we quantify the accuracy of the 
approximation. 

2.3. The Hamiltonian. For the sake of simplicity, we assume that A = ~K k , and in order to lighten 
the notation and to avoid many technicalities, that the volatility is an uncontrolled constant matrix 
a G M dxm . The fact that the volatility is uncontrolled allows us to use the simplified version for the 
Hamiltonian: 

(5) H(t, x, ii, y, a) = (b(t, x, /z, a),y) + f(t, x, p,, a), 

for t G [0,T], x,y G R d , a G R k , and p G V(R d ). Our first task will be to minimize the Hamil- 
tonian with respect to the control parameter, and understand how minimizers depend upon the other 
variables. We shall use the following standing assumptions. 
(A.l) The drift b is an affine function of a in the sense that it is of the form 

(6) b(t,x,p,a) = bi(t,x,p) + b 2 (t)a, 



Vi€ {!,•■■ 



N},Va l G A, 



J l {a*) < J' l (a*-\a l ). 




T 
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where the mapping [0, T] 9tH b 2 (t) G M. dxk is measurable and bounded, and the mapping [0, T] 3 
(t, x, fi) > bi(t, x, /i) G R d is measurable and bounded on bounded subsets of [0, T] xR d xV 2 (R d ). 

Here and in the following, whenever E is a separable Banach space and p is an integer greater 
than 1, V P {E) stands for the subspace of V{E) of probability measures of order p, i.e. having a finite 
moment of order p so that /i G V V {E) if fi G P(-E) and 

(7) M P)E (fJ,) = (J ||x|||dAt(x)j < +oo. 

We write M p for M p R d. Below, bounded subsets of V P (E) are defined as sets of probability measures 
with uniformly bounded moments of order p. 

(A.2) There exist two positive constants A and cl such that for any t G [0,T] and /i G V 2 (R d ), the 
function R d x R k B (x, a) /(i, x, /i, a) G K is once continuously differentiable with Lipschitz- 
continuous derivatives (so that /(i, -, /i, •) is C 1 ' 1 ), the Lipschitz constant in x and a being bounded 
by (so that it is uniform in t and fi). Moreover, it satisfies the convexity assumption 

(8) f(t, x , /i, a) — f(t, x, fi, a) — ((x' — x, a — a), d^ x ^f{t, x, fi, a)) > X\a — a\ 2 . 

The notation d( x >a \f stands for the gradient in the joint variables (x, a). Finally, /, d x f and d a f are 
locally bouded over [0, T]xR d x V 2 (R d ) x R k . 

The minimization of the Hamiltonian is taken care of by the following result. 

Lemma 1. If we assume that assumptions (A. 1-2) are in force, then, for all (t, x, /i, y) G [0, T] x R d x 
V2{R d ) x R k , there exists a unique minimizer a(t, x, /i, y) of H. Moreover, the function [0, T] x M. d x 
V2(R d ) xR d 3 (t,x,[i,y) ^ a(t,x, [i,y) is measurable, locally bounded and Lipschitz-continuous 
with respect to (x, y), uniformly in (t, /i) G [0, T] x V2(R d ), the Lipschitz constant depending only 
upon X, the supremum norm ofb 2 and the Lipschitz. constant of d a f in x. 

Proof. For any given (t, x, /i, y), the function R k B a H(t, x, /i, y, a) is once continuously dif- 
ferentiable and strictly convex so that a(t, x, fi,y) appears as the unique solution of the equation 
d a H(t,x, fj,,y,a(t,x, /jL,y)) = 0. By strict convexity, measurability of the minimizer a(t,x,/i,y) 
is a consequence of the gradient descent algorithm. Local boundedness of a(t, x, /i, y) also follows 
from strict convexity since by ([8]>, 

H(t, x, fj,, y, 0) > H(t, x, /i, y, a(t, x, p, y)) 

> H(t,x,n,y,0) + (a(t,x,n,y),d a H(t,x,[i,y,0)) + \\a(t, x, fi, y) | 2 , 

so that 

(9) \a(t,x,/i,y)\ < A" 1 (\d a f(t, x, fx, 0)| + \b 2 (t)\ \y\) . 

Inequality (|9]) will be used repeatedly. Moreover, by the implicit function theorem, a is Lipschitz- 
continuous with respect to (x,y), the Lipschitz-constant being controlled by the uniform bound on b 2 
and by the Lipschitz-constant of d^ x a )f. □ 
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2.4. Stochastic Maximum Principle. Going back to the program (i)-(iii) outlined in Subsection |2.2| 
the first two steps therein consist in solving a standard minimization problem when the distributions 
(Ht)o<t<T are frozen, and one could express the value function of the optimization problem Q as 
the solution of the corresponding Hamilton- Jacobi-Bellman (HJB for short) equation. This is the 
keystone of the analytic approach to the MFG theory, the matching problem (iii) being resolved by 
coupling the HJB equation with a Kolmogorov equation intended to identify the (p>t)o<t<T with the 
marginal distributions of the optimal state of the problem. 

Instead, the strategy we have in mind relies on a probabilistic description of the optimal states of 
the optimization problem (|4]) as provided by the so-called stochastic maximum principle. Indeed, 
the latter provides a necessary condition for the optimal states of the problem Q: under suitable 
conditions, the optimally controlled diffusion processes satisfy the forward dynamics in a character- 
istic FBSDE, referred to as the adjoint system of the stochastic optimization problem. Moreover, the 
stochastic maximum principle provides a sufficient condition since, under additional convexity con- 
ditions, the forward dynamics of any solution to the adjoint system are optimal. In what follows, we 
use the sufficiency condition for proving the existence of solutions to the limit problem (i)-(iii) stated 
in Subsection |2.2| In addition to (A. 1-2) we will also assume: 

(A.3) The function [0, T] B t <^-> bi(t, x, p) is affine in x, i.e. it has the form [0, T] 3 t bo(t, p) + 
b±(t)x, where bo and b\ are M. d and M. dxd valued respectively, and are bounded on bounded subsets of 
their respective domains. In particular, b reads 

(10) b(t, x, p, a) = bo(t, p) + b\(t)x + b2(t)a. 

(A.4) The function R d x "P 2 (R rf ) 3 (ar, jt/) g(x,p) is locally bounded. Moreover, for any p € 
V2{^ d ), the function M. d 3 x <— >■ g(x,p) is once continuously differentiable and convex and has a 
CL-Lipschitz-continuous first order derivative. 

In order to make the paper self-contained, we state and briefly prove the form of the sufficiency 
part of the stochastic maximum principle as it applies to (ii) when the flow of measures (pt)o<t<T 
are frozen. Instead of the standard version given for example in Chapter IV of the textbook by Yong 
and Zhou [27], we shall use: 

Theorem 1. Under assumptions (A. 1—4), if the mapping [0, T] 3 t 4 /i( £ V2(^ d ) is measurable 
and bounded, and the cost functional J is defined by 



(11) j(P;n)=E 



g{U T ,HT) + \ f(t, U t ,fH, 
Jo 



Pt)dt 



for any progressively measurable process (3 = (f3t)o<t<T satisfying the admissibility condition Q 
where U = (Ut)o<t<T is the corresponding controlled diffusion process 

U t = x + [ b{s,U s ,p, s ,p s )ds + aW t , te[0,T], 
Jo 

for xq G ~K d , and if the forward-backward system 

dX t = b(t, X t , m,u(t, X t , nt,Y t ))dt + crdW t , X = x 
( U dY t = -8 x H(t, X t , y t ,Y t , a(t, X t , y t , Y t )) + Z t dW t , Y T = d x g(X T , y T ) 
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has a solution (Xt, Yt, Zt)o<t<T such that 
(13) E sup (\X t \ 2 + \Y t \ 2 ) + / \Z t \ 2 

_0<t<T Jo 

if we set a t = a(t, X t , [it, Yt), then for any (3 = (fit)o<t<T satisfying (|2]), it holds 



< +oo, 



r T 

j(a;/j)+XE \[3 t -a t \ 2 dt< J(f3;n). 
Jo 



Proof. By Lemma[T| a = (6tt)o<t<T satisfies ([2]), and the standard proof of the stochastic maximum 
principle, see for example Theorem 6.4.6 in Pham [24] gives 

j([3; m) > J(a; fx) + E / [H(t, U t , m, Y t , f%) - H(t, X u m, Y t , a t ) 
Jo 

- (U t - X t ,d x H(t,X t ,fi t ,Y t ,a t )} - (f3 t - a t ,d a H(t, X t , [i t ,Y t ,a t ))]dt. 

By linearity of b and assumption (A.2) on b, the Hessian of H satisfies ([8]), so that the required 
convexity assumption is satisfied. The result easily follows. □ 

Remark 1. As the proof shows, the result ofTheorem^above still holds if the control (3 = (Pt)o<t<T 
is merely adapted to a larger filtration as long as the Wiener process W = (Wt)o<t<T remains a 
Brownian motion for this filtration. 

Remark 2. Theorem^has interesting consequences. First, it says that the optimal control, if it exists, 
must be unique. Second, it also implies that, given two solutions (X, Y, Z) and (X\ Y' , Z') to ( 12 1, 
dP ® dt a.e. it holds 

a(t, X t , fiu Y t ) = a(t, X' t , /x t , Y[), 

so that X and X' coincide by the Lipschitz property of the coefficients of the forward equation. As a 
consequence, (Y, Z) and (Y' , Z') coincide as well. 

It should be noticed that in some sense, the bound provided by Theorem[T]is sharp within the realm 
of convex models as shown for example by the following slight variation on the same theme. We shall 
use this form repeatedly in the proof of our main result. 

Proposition 1. Under the same assumptions and notation as in Theorem^above, if we consider in 
addition another measurable and bounded mapping [0, T] 3 t «— )■ y! t G V%(^ >r ) and the controlled 
diffusion process U' = {U' t )o<t<T defined by 

U[ = x' + / b(s, U' s , f x' s ,f3 s )ds + aW t , t G [0, T], 
Jo 

for an initial condition x' £ ~R. d possibly different from xq, then, 

J (a; n) + (x' - x , Y ) + AE / \f3 t - a t \ 2 dt 

< J([(3^']-iJt) +E / (bo(t,/j' t )-bo{t,fit),Y t )dt 
Uo 
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where 

(15) j([/3,i/\;n) =eLc4,/x t )+ f T f(t,Ul,m,Pt)dt 

L J o 

r/je parameter [/?,//] in f/ie cosf J([/3, //]; //) indicates that the flow of measures in the drift ofU' 
is (/4)o<i<T whereas the flow of measures in the cost functions is (pt)o<t<T- In fact, we should 
also indicate that the initial condition x' might be different from xq, but we prefer not to do so since 
there is no risk of confusion in the sequel. Also, when x' = xq and p! t = fit for any t S [0, T], 
J([/3, M / ];/i) = J(/3; M ). 

Proof. The idea is to go back to the original proof of the stochastic maximum principle and using 
Ito's formula, expand 



f(Ul - X t , Y t ) + j [f(s, U' s ,n s , t3 s ) - f{s, X s , n„ &,)] ds 



0<t<T 



Since the initial conditions xq and x' are possibly different, we get the additional term (x' — xq, Yq) in 
the left hand side of ( fT4] >. Similarly, since the drift of U' is driven by (/u' 4 )o<t<T, we get the additional 
difference of the drifts in order to account for the fact that the drifts are driven by the different flows 
of probability measures. □ 

3. The Mean-Field FBSDE 

In order to solve the standard stochastic control problem ([4]) using the Pontryagin maximum prin- 
ciple, we minimize the Hamiltonian H with respect to the control variable a, and inject the minimizer 
a into the forward equation of the state as well as the adjoint backward equation. Since the minimizer 
a depends upon both the forward state X t and the adjoint process Y t , this creates a strong coupling 
between the forward and backward equations leading to the FBSDE ([T2]). The MFG matching condi- 



tion (hi) of Subsection 2.2 then reads: seek a family of probability distributions (p.t)o<t<T of order 
2 such that the process X solving the forward equation of ( [T2] ) admits (/J,t)o<t<T as flow of marginal 
distributions. 

In a nutshell, the probabilistic approach to the solution of the mean-field game problem results in 
the solution of a FBSDE of the McKean-Vlasov type 

dX t = b(t, X t , F Xt , &(t, X t ,F Xt , Y t )) dt + adW t , 

(16) 

dY t = -d x H(t,Xt,F Xt ,Yt,a(t,Xt,F Xt ,Y t ))dt + Z t dWt, 

with the initial condition Xq = xq € M. d , and terminal condition Yp = d x g(Xx, ^x T )- To the best of 
our knowledge, this type of FBSDE has not been considered in the existing literature. However, our 
experience with the classical theory of FBSDEs tells us that existence and uniqueness are expected 



to hold in short time when the coefficients driving ( |T6| ) are Lipschitz-continuous in the variables x, 
a and \i from standard contraction arguments. This strategy can also be followed in the McKean- 
Vlasov setting, taking advantage of the Lipschitz regularity of the coefficients upon the parameter 
fi for the 2-Wasserstein distance, exactly as in the theory of McKean-Vlasov (forward) SDEs. See 
Sznitman [26]. However, the short time restriction is not really satisfactory for many reasons, and in 
particular for practical applications. Throughout the paper, all the regularity properties with respect to 
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fx are understood in the sense of the 2-Wasserstein's distance W 2 - Whenever E is a separable Banach 
space, for any p > 1, p, fx' G V P (E), the distance W p (p, fx') is defined by: 

W p (ix,fx') 

{ \ f 1 l ' p 1 

= inf < / \x — y\ p E tr(dx, dy) ; ir G 7-p(i2 x with marginals /i and // > . 
[ [JexE J \ 

Below, we develop an alternative approach and prove existence of a solution over arbitrarily pre- 
scribed time duration T. The crux of the proof is to take advantage of the convexity of the coeffi- 
cients. Indeed, in optimization theory, convexity often leads to compactness. Our objective is then 
to take advantage of this compactness in order to solve the matching problem (iii) in ([4]) by applying 
Schauder's fixed point theorem in an appropriate space of finite measures on C([0, T]; M. d ). 

For the sake of convenience, we restate the general FBSDE ( 16 1 of McKean-Vlasov type in the 
special set-up of the present paper. It reads: 

dX t = [b (t, F Xt ) + h(t)X t + b 2 {t)a{t, X t ,F Xt ,Y t )] dt + adW t , 

(17) 

dY t = -[b\(t)Y t + d x f(t,X t ,F Xt ,a(t,X t ,F Xt ,Y t ))]dt + Z t dW t , 
where a) denotes the transpose of the matrix a. 

3.1. Standing Assumptions and Main Result. In addition to (A. 1-4), we shall rely on the following 
assumptions. 

(A.5) The functions [0,T] 3 t ^ f(t,0,5 ,0), [0,T] 3 t >-> d x f(t,0,S ,0) and [0,T] 3 t ^ 
d a f (t, 0, 8 , 0) are bounded by c L , and, for all t G [0, T], x, x' G R d , a, a' G R k and fx, fx' G V 2 (^ d ), 
it holds: 

\(f,9)(t,x',Li',a') - (f,g)(t,x,n,a)\ 

<c L [l + \(x', a') | + \{x,a)\ + M 2 (/x) + M 2 (//)] [\(x', a') - (x, a)\ + W 2 (fi', »)] ■ 

Moreover, bo, b\ and b 2 in ( fTO] ) are bounded by cl and bo satisfies for any /x, fx' G 7^2(1^^): \bo(t, fx') — 
bo(t,tx)\<c L W 2 (ix,ix'). 

(A.6) For all t G [0, T], x G R d and /x G V 2 (R d ), \d a f(t, x, fx, 0)| < c L . 

(A.7)Forall(t,x) G [0,T]x]R d , {x, d x f(t, 0, 6 X , 0)) > -c L (l+\x\), (x,d x g(0,5 x )) > -c L {l+\x\). 



Theorem 2. Under (A. 1-7), the forward-backward system ( |16| > has a solution. Moreover, for any 
solution (Xt, Yt, Zt)o<t<T to ( |16| >, there exists a function u : [0, f|xl li 4 R d (referred to as the 
FBSDE value function), satisfying the growth and Lipschitz properties 



(18) 



Vt€[0,Tj, V^el, I |n(t,x)-^,z')|< 



x) — u(t, x')\ < c\x — x'\, 

for some constant c > 0, and such that, F-a.s., for all t G [0, T], Yf = u(t, Xt). In particular, for any 

£> I, E[supo<t< T \Xt\ £ } <+oo. 

(A.5) provides Lipschitz continuity while condition (A.6) controls the smoothness of the running 
cost / with respect to a uniformly in the other variables. The most unusual assumption is certainly 
condition (A.7). We refer to it as a weak mean-reverting condition as it looks like a standard mean- 
reverting condition for recurrent diffusion processes. Moreover, as shown by the proof of Theorem|2j 
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Its role is to control the expectation of the forward equation in ([16]) and to establish an a priori bound 
for it. This is of crucial importance in order to make the compactness strategy effective. We use the 
terminology weak as it is not expected to converge with time. 

Remark 3. An interesting example which we should keep in mind is the so-called linear-quadratic 
model in which bo, f and g have the form: 

bo(t,fj) = b (t)]l, g(x,fi) = ^\qx + qjl\ 2 , f(t,x,p,,a) = ^ \m(t)x + m(t)jt\ 2 + ^\n(t)a\ 2 , 

where q, q, m(t) and fh(i) are elements of~R dxd , n(t) is an element ofK kxk and p stands for the 
mean of ft. In this framework, (A.7) says that q^q > and m(tym(t) > in the sense of quadratic 
forms. In the one-dimensional case d = 1, (A.7) says that qq and m(t)fh(t) must be non-negative. 
As shown in Q, this condition is not optimal for existence, as the conditions q(q + q) > and 
m(t)(m(t) + fh(t)) > are sufficient to guarantee the solvability of (16l. Obviously, the gap 
between these conditions is the price to pay for treating general systems within a single framework. 

3.2. Rigorous Definition of the Matching Problem. The proof of Theorem|2]is split into four main 
steps. The first one consists in making the statement of the matching problem (iii) in Q rigorous. To 
this end, we need the following 

Lemma 2. Given fi e V2(C([0,T];W 1 )) with marginal distributions (nt)o<t<T> the FBSDE ( fT2| ) is 
uniquely solvable. If we denote its solution by (X x °'^ , Y x °'^, Z^ 0,/i )o<t<T> then there exist a constant 
c > 0, only depending upon the parameters of (A.I— 7), and a locally bounded measurable function 
: [0, T] x R d 4 R d such that 

Vx,x' £R d , \u^(t,x') - vP{b,x)\ <c\x'-x\, 
andF-a.s.,forallt £ [0, T], Y t x ° ;fl = u^(t,X^). 

Proof. We know that d x H reads d x H(t, x, /i, y, a) = b\(t)y + d x f(t, x, ft, a), so that, by Lemma 
[l] the driver [0, T] x R d x R d B (t, x, y) d x H(t, x, fit, &(t, x, UtiU)) of the backward equation 
in {Y2\ is Lipschitz continuous in the variables (x, y), uniformly in t. Therefore, by standard results 
in FBSDE theory, existence and uniqueness hold when T is small enough. Equivalently, when T is 
arbitrary, there exists 5 > 0, depending on the Lipschitz constant of the coefficients in the variables 
x and y such that unique solvability holds on [T — 6,T], that is when the initial condition xq of 
the forward process is prescribed at some time to € [T — 5, T] . The solution is then denoted by 
(X*°' x ° , Yt°' x ° , Zl°' x °)t <t<T- Following Delarue JH, existence and uniqueness hold on the whole 
[0,T], provided 

(19) V* ,4 G \Yl°' X0 - Y? °' x '°\ 2 < c\x - x' \ 2 , 

for some constant c independent of to and 5. Notice that, by Blumenthal's Zero-One Law, the random 
variables Y*°' x ° and Y^' x ° are deterministic. By ( 14 1, we have 

(20) J^ X0 + (x' - x , Yj°' Xo ) + AE / l&l ^ - a t °' x ° \ 2 dt < J to ' x '°, 

J t 

where J to ' x ° = J{{a t 0,X °)to<t<T; n) and d* '* = a(t, m, Y t to ' x °) (with similar definitions 

for J tQ ' x 'o and a° ,x ° by replacing xq by x' ). Exchanging the roles of xq and x' and adding the 
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resulting inequality with pO] ), we deduce that 

A to ' x O\2 



(21) 



t 



a°' °\ 2 dt< (x' -x ,Y t 



t 



to,xo\ 



to 



2AE / \a t 
'to 

Moreover, by standard SDE estimates first and then by standard BSDE estimates, there exists a con- 
stant c (the value of which may vary from line to line), independent of to and 6, such that 

f-T 

't ,x v to,x' ,21 | w [ \ v t ,x 



E[ sup \X\ 

t <t<T 



Xr°\ z \ +E[ sup \Y? 

t <t<T 



Y 



to,x' ,21 



<cE I \a 
'to 



*to,Xo 

t 



a t u | at. 



Plugging plj ) into the above inequality completes the proof of ( |T9[ ). 

The function is then defined as u M : [0, T] x R rf 9 (t, x) >■ Y/' x . The representation property 
of y in terms of X directly follows from [8]. Local boundedness of follows from the Lipschitz 
continuity in the variable x together with the obvious inequality: 

E[K(t,Xr)-^0)|]+E[|Y t °'°|] 



sup o <i<TK(*>0)l < sup 0<t<T 



< +oo. 



□ 



We now set 



Definition 1. To each \i G V2(C([0, T];R )) with marginal distributions (fit)o<t<T> we associate the 
measure P^o;** where X x °'^ is the solution of ( |12[ ) with initial condition xq. The resulting mapping 
V 2 {C([0,T];R d )) 3 (j, F x *o^ € V 2 {C{[0,T\;R d )) is denoted by <S> and we call solution of the 
matching problem (Hi) in Q any fixed point fi of for .roc/z a fixed point p, X Xa] ^ satisfies ( |16| >. 

Definition [T] captures the essence of the approach of Lasry and Lions who freeze the probability 
measure at the optimal value when optimizing the cost. This is not the case in the study of the control 
of McKean-Vlasov dynamics, as investigated in Hi in this different setting, optimization is also 
performed with respect to the measure argument. See also [7] and [2] for the linear quadratic case. 

3.3. Existence under Additional Boundedness Conditions. We first prove existence under an extra 
boundedness assumption. 



Proposition 2. The system ( 16) is solvable if in addition to (A. 1-7), we also assume that d x f and 
d x g are uniformly bounded, i.e. for some constant cb > 

(22) Vi€ [0,T], xeR d , fieV 2 (M. d ), aeR k , \d x g(x,p)\, \d x f(t,x,p,a)\ < c B . 



Notice that @ implies (A.7). 

Proof. We apply Schauder's fixed point theorem in the space 7Wi(C([0, T]; R d )) of finite signed 
measure v of order 1 on C([0, T]; R d ) endowed with the Kantorovich-Rubinstein norm: 



f KB. = sup 



F(w)dv(w) 



FGLi Pl (C([0,T];]R d )) , 



C([0,T];R d ) 

forz^ <E Mi(C([0,T];R d )), which is known to coincide with the Wasserstein distance W\ on V\ (C ( [0 , T] ; R d ) ) . 
In what follows, we prove existence by proving that there exists a closed convex subset £ C V2(C([0,T];R d )) C 
Ai\(C([0, T]; R d )) which is stable for <£, with a relatively compact range, <3? being continuous on £. 
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First Step. We first establish several a priori estimates for the solution of ( fT2] >. The coefficients 
d x f and d x g being bounded, the terminal condition in ( fT2] ) is bounded and the growth of the driver is 
of the form: 

\d x H(t,x, ix t ,y,a.(t,x, Ht,y)) \ < c B + c L \y\. 
By standard BSDE estimates relying on Gronwall's lemma, this implies that there exists a constant c, 
only depending upon cb,cl and T, such that, for any p, G ^(CQO, T\\ M. d )), 

(23) Vte[0,T], |if o;M |<e 

holds P-almost surely. By ([9]) in the proof of Lemma[T]and by (A.6), we deduce that (the value of c 
possibly varying from line to line) 

(24) Vt G [0, T], a(i, Xf * ^t, Y t Xo '^) < c. 

Plugging this bound into the forward part of ( fT2| ), standard LP estimates for SDEs imply that there 
exists a constant c', only depending upon cb, cl and T, such that 

(25) E[ sup \X^\ 4 ] <c'. 

0<t<T 

We consider the restriction of $ to the subset £ of probability measures of order 4 whose fourth 
moment is not greater than c', i.e. 

£ = {pe P 4 (C([0,T],M d )) : M 4)C([0)T])Rd) ( M ) < c'}, 

£ is convex and closed for the 1-Wasserstein distance and $ maps £ into itself. 

Second Step. The family of processes ((Xf°'' i )o<t<r)^e£' is tight in C([0,T];M. d ), as a conse- 
quence of ( [24] > and ( [25] >. By ( |25] > again, $(£ ) is actually relatively compact for the 1-Wasserstein 
distance on C([0, T]; Indeed, tightness says that it is relatively compact for the topology of weak 
convergence of measures and p5] > says that any weakly convergent sequence (Fx x o^™) n >i, with 
p n 6 £ for any n > 1, is convergent for the 1-Wasserstein distance. 

Third Step. We finally check that <I> is continuous on £ . Given another measure p' G £ , we deduce 
from (14 1 in Proposition[T]that: 

r T r T 

(26) J(a;p)+XE \a' t - a t \ 2 dt < J ([&' , p'\, p) + E / {b (t, p' t ) - b (t, p t ),Y t )dt, 

Jo Jo 

where at = a(t, X^ '^, pt, Yf '^), for t G [0, T], with a similar definition for by replacing // by 
p! . By optimality of a' for the cost functional J(-; //), we claim: 

J( [&', p'] ; //) < J (a; p!) + J( [a', /i'] ; p) - J (a'; p!) , 

so that d26]) yields 



(27) 



AE / — at \ 2 dt < j(a; //) — J(a; /i) + J( [a', /i'] ; //) — J(d'; 

+ E / (b (t,p' t )-b (t,p t ),Y t )dt. 
Jo 



We now compare J(a; p') with J(a; ,u) (and similarly J(a'; p 1 ) with J([a', //]; //)). We notice that 
J(a; //) is the cost associated with the flow of measures (pt)o<t<T and the diffusion process X 300 '^ 
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whereas J (a; fi) is the cost associated with the flow of measures (/4)o<t<T an d the controlled diffu- 
sion process U satisfying 

dU t = [bo(*,/4) + h(t)U t + b 2 (t)at]dt + adW t , te[0,T]; U = x . 
By Gronwall's lemma, there exists a constant c such that 

E[ sup \X^-U t \ 2 ] <c C Wi{nu»' t )dt. 

0<t<T ' Jo 



Since fi and fjf are in £, we deduce from (A.5), ( [24] > and ( [25] ) that 

rT \ 1/2 



J 



(a;fj!) - J(a;fj) < cfj^ W${fi t , fi' t )dt 



with a similar bound for J([d / , //] ; /i) — J(a'; //) (the argument is even simpler as the costs are driven 
by the same processes), so that, from ( [27] > and ( |23j ) again, together with Gronwall's lemma to go back 
to the controlled SDEs, 

rT / f-T \ 1/2 



E 



/ \a' t - a t \ 2 dt + E[ sup \X?°* - X^'\ 2 ] < c( [ W^ u fJ t )dt] 

JO 0<t<T \Jo J 



As probability measures in £ have bounded moments of order 4, Cauchy-Schwartz inequality yields 
(keep in mind that Wi (*(//), $(//)) < E[sup < t < T |Xf o;M - X^' \]): 

/ rT \ 1/4 / \ 1/4 

wi(*(m),*(/*'))<c(/ wK^./iiw < c (y o WT /a (Mt,/4) dt J . 

which shows that <I> is continuous on £ with respect to the 1-Wasserstein distance W\ on Vi(C([0, T];R d )). 

□ 

3.4. Approximation Procedure. Examples of functions / and g which are convex in x and such that 
d x f and d x g are bounded are rather limited in number and scope. For instance, boundedness of d x f 
and d x g fails in the typical case when / and g are quadratic with respect to x. In order to overcome 
this limitation, we propose to approximate the cost functions / and g by two sequences (f n ) n >i 
and (<? n ) n >i> referred to as approximated cost functions, satisfying (A. 1-7) uniformly with respect 
to n > 1, and such that, for any n > 1, equation ( fT6] ), with (d x f,d x g) replaced by {d x f n ,d x g n ), 
has a solution (X n ,Y n , Z n ). In this framework, Proposition [2] says that such approximated FBSDEs 
are indeed solvable when d x f n and d x g n are bounded for any n > 1. Our approximation procedure 
relies on the following: 

Lemma 3. If there exist two sequences (f n ) n >i and (g n ) n >i such that 

(i) there exist two parameters c' L and A' > such that, for any n > 1, f n and g n satisfy (A. 1—7) 
with respect to X' and c' L ; 

(ii) f n ( resp. g n ) converges towards f ( resp. g) uniformly on any bounded subset of [0,T] x M. d x 
V 2 (R d ) x R k (resp. R d x P 2 (R d )j; 

(in) for any n > 1, equation ( |16| >, with (d x f, d x g) replaced by (d x f n , d x g n ), has a solution which 
we denote by (X n ,Y n ,Z n ). 
Then, equation ( fT6] ) is solvable. 
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Proof. We establish tightness of the processes (X n ) n >i in order to extract a convergent subsequence. 
For any n > 1, we consider the approximated Hamiltonian 

H n (t, x, fj,, y, a) = (b(t, x, (i, a),y) + f n (t, x, fj,, a), 

together with its minimizer a n (t, x, fx, y) = argmin Q ,i/ n (i, x, fx, y, a). Setting a" = a n (t, X™, F x « , 
for any t 6 [0, T] and n > 1, our first step will be to prove that 



(28) 



supE 

n>l 



T 



l«s \ 2 ds 



< +oo. 



Since X n is the diffusion process controlled by (a")o<t<r> we use Theorem[T]to compare its behavior 
to the behavior of a reference controlled process U n whose dynamics are driven by a specific control 
(3 n . We shall consider two different versions for U n corresponding to the following choices for j3 n : 



(29) 



(i) & n = E(o£) for < s < T; (ii) (3 n = 0. 



For each of these controls, we compare the cost to the optimal cost by using the version of the 
stochastic maximum principle which we proved earlier, and subsequently, derive useful information 
on the optimal control (a™ )o<s<t- 

First Step. We first consider (i) in ( |29] >. In this case 

(30) U? = x + [\b (s,F X n) + b 1 (s)U™ + b 2 (s)E(a™)]ds + aW t , t e [0,T]. 
Jo 



Notice that taking expectations on both sides of ((30]) shows that E(U") = E(X"), for < s < T, 
and that 

[Ur-E(un} = fbi{s)[U'--nU7)]ds + aW t , te[0,T], 
Jo 

from which it easily follows that sup n>1 sup <<j<T Var(C/") < +oo. 

By Theorem^TJ with g n (-,F X n) as terminal cost and (f n (t, -,F X n, -))o<t<T as running cost, we 
get 



(31) 



R[g n (XZ,Fxn)]+E / [A'|d? - + P {s , X? ,W x » , a»)]ds 



< E 



Jo 



Using the fact that /3" = E(d"), the convexity condition in (A.2,4) and Jensen's inequality, we obtain: 



(32) 



T 



g n (M(X%),F X n) + / [A / Var(a") + r( S; E(^),P x „,E(d"))] ( i S 



< E 



<f([/-,P x «) + / r( S ,C/r,Pxn,E(^))d s 
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By (A.5), we deduce that there exists a constant c, depending only on A, cl, xq and T, such that (the 
actual value of c possibly varying from line to line) 

f T Vw{a n s )ds < c(l + E[|[/£| 2 ] 1/2 + E[|X£| 2 ] 1/2 )E[|[/£ - E(X£)| 2 ] 1/2 
•/ii 

+ c f (l + E [|L^| 2 ] 1/2 + E [\X™\ 2 ] 1/2 + E[|a™| 2 ] 1/2 )E[|C/ S n - E(X s n )| 2 ] 1,2 ds. 
Jo 

Since E(X") = E(t7") for any i € [0, T], we deduce from the uniform boundedness of the variance 

of (U?)o< s <t that 



(33) / Var(a™)ds < c 1 + sup EflX™] 2 ] 1 / 2 + ( E / |a£| 2 (i^) 



1/2-, 



From this, the linearity of the dynamics of X n and Gronwall's inequality, we deduce: 

cT \ l/2-i 



(34) 



since 



(35) 



sup Var(X s n ) < c 

0<s<T 



1+ ( E / |a"| 2 (is 



sup E[|X™| 2 ] < c 

0<s<T 







1 + E / \a n s \ 2 ds 



Bounds like p4| ) allow us to control for any < s < T, the Wasserstein distance between the 
distribution of X™ and the Dirac mass at the point E(X"). 

Second Step. We now compare X n to the process controlled by the null control. So we consider 
case (ii) in (29), and now 

K = x + [ [bo(s,F x ?) + bi(8)U?]da + aWt, te[0,T\. 
Jo 

Since no confusion is possible, we still denote the solution by U n although it is different form the 
one in the first step. By the boundedness of bo in (A.5), it holds sup n>1 E[sup 0<s<T |C/™| 2 ] < +oo. 
Using Theorem [T] as before in the derivation of (31 ) and (32), we get 

rT 



g n (E(X%),F X n] 



[A'E(| 



a 



n\2\ 



f n (s,E(X2),Wx?M^))]ds 



< E 



«f(t/«,P x? ) + f T f n (s,U2,F X n,0)ds 
Jo 



By convexity of f n with respect to a (see (A.2)) together with (A.6), we have 

g n (E(X^,8 E{x ^) + / T [A'E(|a™| 2 ) +/"( S) E(X a "),P x? ,0)]ds 



< E 



g n (U2,F X n)+ f T f n (s,U?,F X n,0)ds\ +cE f T \a n s \ds, 
Jo . Jo 
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for some constant c, independent of n. Using (A.5) again, we obtain: 

g n (E(XZ),6 EiX n } ) + [ T [\'E(\a^\ 2 ) + f n {s,E(Xs),SE(x?),0)]ds 
Jo 

< <f(0,<W)) + / f n {s,0,5 E{x ^,0)ds + cE [ \a^\ds 
Jo Jo 



+ c(l+ sup [E[\X^\ 2 ] l/2 ])(l+ sup [Var(X s n )l 

0<s<T 0<s<T 



l/2s 



the value of c possibly varying from line to line. From ( T35| ), Young's inequality yields 

rT - A' 



g n {E(X2),6 m n ) )+ J [-E(|a™| 2 ) + f n (s,E(K),SE(x?),0)]ds 

<9 n {0,S E(x ^)+ [ T f n (s,0,6 E{X n ) ,0)ds + c(l+ sup [Var(*?)]). 

JO 0<s<T 

By ((34]), we obtain 

9 n {E(X^),6 m n ) ) +£[^E{\^\ 2 ) +r(s,E(K),h(xn ) ,0)}ds 

<9 n {0,S E{X n ) )+ J f n (s,0,5 E{X n ) ,0)ds + c(l+ J Ed&^ds 
Young's inequality and the convexity in x of g n and f n from (A.2,4) give: 

<E(X^),^™(0,<5 E(X?) )> + / T [^E(|d-| 2 ) + (E(X^,d x f n (s,0,S E{x?) ,0))]ds < c. 



By (A.7), we have E f Q T \a™\ 2 ds < c(l + sup < s < r E[|A7| 2 ] 1/2 ), and the bound (2S) now follows 
from (|35]>, and as a consequence 



(36) 



E[ sup |X S T] < c. 

0<s<T 



Using ( |28[ > and ( pq ), it is plain to prove that the processes (X n ) n >i are tight. 

Third Step. Let \i be the limit of a convergent subsequence (Px"p )p>i- By ((36|), M2,c([o,T],R d ) (f 1 ) < 
+oo. Therefore, by Lemma|2j FBSDE ( [T2] ) has a unique solution Y t5 Zt)o<t<T- Moreover, there 
exists u : [0, T] xR^h M, which is c-Lipschitz in the variable x for the same constant c as in the 
statement of the lemma, such that Yt = u(t, Xt) for any t G [0, T\. In particular, 



(37) sup \u(t, 0)| < sup 

0<t<T 0<t<T 



E[\u(t,X t )-u(t,0)\} +E[\Y t \] 



< +oo. 



We deduce that there exists a constant d such that \u(t, x)\ < c'(l + |x|), for t G [0, T] and x £ M d . 
By ([9]) and (A.6), we deduce that (for a possibly new value of d) \&(t, x, fit, u(t, x))\ < d{\ + \x\). 
Plugging this bound into the forward SDE satisfied by X in (fT2]), we deduce that 



(38) 



> 1, E[ sup \X t \ l ] < + 

0<t<T 



oo, 
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and, thus, 
(39) 



E 



T 



\6tt\ 2 dt < +00, 



with a>t = a(t, X t , fit, Y t ), for t 6 [0, T]. We can now apply the same argument to any (X™)o<t<7\ 
for any n > 1. We claim 

(40) W > 1, 



supE[ sup \X 1 

n>l 0<t<T 



n\t\ 
t I I 



< +OO. 



Indeed, the constant c in the statement of Lemma [2] does depend on n. Moreover, the second-order 
moments of sup < t <r \Xt\ w& bounded, uniformly in n > 1 by ([36j>. By (A.5), the driver in the back- 
ward component in ([12]) is at most of linear growth in (x, y, a), so that by ( |28| ) and standard L 2 esti- 



mates for BSDEs, the second-order moments of sup <t<;r \ Y t n \ are uniformly bounded as well. This 
shows ( |40| by repeating the proof of ( |38] >. By ([38) and ( |40[ ), we get that sup 0<t<T W2(pt P , fit) — > 
as n tends to +00, with fi n p = Fx n p- 
Repeating the proof of (27 ), we have 

r-T 



A'E 



(41) 



OLt 







a t \ 2 dt < J n {a;fi n ) - J(d;/x) + J([a n , fi n ]; fi) - J n (a n ;fi n ) 



+ E / {bo(t,n2)-b (t,(j, t ),Y t }dt, 



where J(-; /x) is given by ( fTT| ) and J n (-; /U n ) is defined in a similar way, but with (/, g) and (fit)o<t<T 
replaced by (f n ,g n ) and {ftf)o<t<T\ J([& n , A*"]; A 4 ) is defined as in ( fT3[ ). With these definitions at 
hand, we notice that 

J n (a;/i n ) -J{a-fi) 

= E[g n (U^n^)-g{X T , t i T )] +E [f n {t,U?, tf,a t ) - f(t,X t , fi t ,a t )]dt, 

Jo 

where U n is the controlled diffusion process: 

dU? = [b (t, f4) + h(t)U? + 6 2 (*)at] dt + adW t , t £ [0, T]; [/" = x . 

By Gronwall's lemma and by convergence of fi n p towards fi for the 2-Wasserstein distance, we claim 
that U np — > X as p — > +00, for the norm E[sup 0<s<r | - s I 2 ] 1 / 2 . Using on one hand the uniform 
convergence of f n and g n towards / and g on bounded subsets of their respective domains, and on 
the other hand the convergence of fi np towards fi together with the bounds (38 -39), we deduce that 
J np (a; fi Up ) — > J (a; fi) as p — > +00. Similarly, using the bounds (28 - 38-40 >, the other differences 

dasp- 



in the right-hand side in ( |4T| ) tend to along the subsequence (n p ) p >i so that a np — > a as p — > +00 
in L 2 ([0, T] x Q, dt (8) dP). We deduce that X is the limit of the sequence (X np ) p >i for the norm 
E[sup 0<s<T I - s I 2 ] 1 / 2 . Therefore, fi matches the law of X exactly, proving that equation ( [To} is 
solvable. □ 

3.5. Choice of the Approximating Sequence. In order to complete the proof of Theorem[2j we must 
specify the choice of the approximating sequence in Lemma [3] Actually, the choice is performed in 
two steps. We first consider the case when the cost functions / and g are strongly convex in the 
variables x: 
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Lemma 4. Assume that, in addition to (A. 1-7), there exists a constant 7 > such that the functions 
f and g satisfy (compare with dSl): 

f(t,x',(i,a') - f(t,x,n,a) 
(42) — ((x' — x, a' — (x),dr Xt0l \f(t, x, p, a)) > j\x' — x\ 2 + X\a 



a 



2 



g{x', At) - g(x, p) - {x - x, d x g(x, p)) > j\x' - x 



2 



Then, there exist two positive constants X 1 and c' L , depending only upon X, cl and 7, and two se- 
quences of functions (f n ) n >i an d {g n ) n >i such that 

(i) for any n > 1, f n and g n satisfy (A. 1—7) with respect to the parameters X' and c' L and d x f n 
and d x g n are bounded, 

(ii) for any bounded subsets of [0, T] x R d x ^O^) x R fc , there exists an integer no, such that, 
for any n > tiq, f n and g n coincide with f and g respectively. 

The proof of Lemma [4] is a pure technical exercise in convex analysis, and for this reason, we 
postpone its proof to an appendix at the end of the paper. 

3.6. Proof of Theorem|2j Equation ( fT6] > is solvable when, in addition to (A. 1-7), / and g satisfy the 
convexity condition ( |4"2"| ). Indeed, by Lemma|4j there exists an approximating sequence (f n ,g n ) n >i 
satisfying (£) and (ii) in the statement of Lemma |3l and also (Hi) by Proposition [2] When / and 
g satisfy (A. 1-7) only, the assumptions of Lemma |3| are satisfied with the following approximating 
sequence: 

f n (t, x, fx, a) = f(t, x, (i, a) + -\x\ 2 ; g n (x, p) = g(x, p) + -|x| 2 , 

n n 

for (t, x, fi, a) e [0, T) x R d x V(R d ) x R k and n > 1. Therefore, (16) is solvable under (A. 1-7). 
Moreover, given an arbitrary solution to ( fT6] >, the existence of a function u, as in the statement of 
Theorem[2] follows from Lemma|2]and ([37 1. Boundedness of the moments of the forward process is 
then proven as in ([38]). □ 



4. Propagation of Chaos and Approximate Nash Equilibriums 

While the rationale for the mean-field strategy proposed by Lasry-Lions is clear given the nature 
of Nash equilibriums (as opposed to other forms of optimization suggesting the optimal control of 
stochastic dynamics of the McKean-Vlasov type as studied in [6]), it may not be obvious how the 
solution of the FBSDE introduced and solved in the previous sections provides approximate Nash 
equilibrium for large games. In this section, we prove just that. The proof relies on the fact that 
the FBSDE value function is Lipschitz continuous, standard arguments in the propagation of chaos 
theory, and the following specific result due to Horowitz et al. (see for example Section 10 in ||2"5l ) 
which we state as a lemma for future reference: 

Lemma 5. Given p £ Vd+5(R d ), there exists a constant c depending only upon d and Md+s(p) (see 
the notation (JJJ)), such that 

E[Wi(p N ,p)) <CN- 2 ^ d+4 \ 
where p, denotes the empirical measure of any sample of size N from p. 
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Throughout this section, assumptions (A. 1-7) are in force. We let (Xt, Yt, Zt)o<t<T be a solution 
of ( |T6| > and u be the associated FBSDE value function. We denote by (nt)o<t<T the flow of marginal 
probability measures \x t = Wx t , for < t < T. We also denote by J the optimal cost of the limiting 
Mean-Field problem 

fT 



(43) 



J = E 



g(X T ,fj, T )+ / f(t,X t ,fj, t ,a(t,X t ,n t ,Y t ))dt 
Jo 



where as before, a is the minimizer function constructed in Lemma [T] For convenience, we fix a 
sequence ((W£ )o<t<T)i>i of independent m-dimensional Brownian motions, and for each integer 
N, we consider the solution (Xj, . . . , X^)o<t<T of the system of ./V stochastic differential equations 



(44) 



dX\ 



b (t, XI ,fi?,a[t, X\ ,ti t ,u(t,Xl)))dt + adWt , fl( 



N 



1 N 



with t 6 [0, T] and X l = xq. Equation (44 1 is well posed since u satisfies the regularity property ( [18] ) 
and the minimizer a(t, x, Ht,y) was proven, in Lemma [T] to be Lipschitz continuous and at most of 
linear growth in the variables x and y, uniformly in t € [0, T]. The processes (X*)i<j<7v give the 
dynamics of the private states of the N players in the stochastic differential game of interest when the 
players use the strategies 



(45) 



o. 



N,i 



&(t t Xi,K } u(t,X*)), 



< t < T, i G {1, 



,N}. 



These strategies are in closed loop form. They are even distributed since at each time t 6 [0, T], 
a player only needs to know the state of his own private state in order to compute the value of the 
control to apply at that time. By boundedness of bo and by ^~ 

fT 



(46) 



sup max 

N>1 l<i<N 



E[ sup \Xf\ 2 ] + E [ 

0<t<T Jo 



a i 



and (|T8]>, it holds 

-N,i\2 



< +oo. 



For the purpose of comparison, we introduce the notation we use when the players choose a generic 
set of strategies, say ({Pl)o<t<T)i<i<N- In this case, the dynamics of the private state U l of player 
i € {1, ■ • • , N} are given by: 



(47) 



dUl = b{t,Ui,u t N ,Pl)dt + adWl, 



-N 



1 N 



N 



with t G [0, T] and Uq = xq, and where {{fi\)o<t<T)i<i<N are N square-integrable M fc -valued 
processes that are progressively measurable with respect to the filtration generated by (W 1 , . . . , W ). 
For each 1 < i < N, we denote by 



(48) 



J N ' i (f3\...,p N ) = E 



9{Ut,*t)+ / f(t,U},v?,fi)dt 



the cost to the zth player. Our goal is to construct approximate Nash equilibriums for the N -player 
game from a solution of ( fl6] ). We follow the approach used by Bensoussan et al. in the linear- 
quadratic case. See also Q. 
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Theorem 3. Under assumptions (A.l-7), the strategies (a^ ,l )o<t<T, i<j<Ar defined in (|45|> form an 
approximate Nash equilibrium of the N -player game ( |47f|48[ ). More precisely, there exists a constant 
c > and a sequence of positive numbers (ejv) n>i such that, for each N > 1, 

(i) e N < ciV-V(d+4) . 

(ii) for any player i £ {1, • • • , N} and any progressively measurable strategy /3* = {fil)o<t<T, 
such that E \(3l\ 2 dt < +oo, one has 



(49) 



N,ir-1,N 



, a 



-1,7V 



i+l,N 



a 



N,N\ 



\ jN,if-l,N 

> J ' (a ' , 



, at. 



N,N\ 



e N . 



Proof. By symmetry (invariance under permutation) of the coefficients of the private states dynamics 
and costs, we only need to prove ( |49"1 ) for % = 1. Given a progressively measurable process /3 1 = 
{Pt)o<t<T satisfying E \f3]\ 2 dt < +oo, let us use the quantities defined in ( |47| ) and ( |48| ) with 



N,i 



for i e {2, • • • , N} and i £ [0, T]. By boundedness of bo, b\ and 62 and by Gronwall's 



inequality, we get: 
(50) 



E 



sup \U t 

0<t<T 



1 1 2 



< c 1 + E 



\Pl?dt 



Using the fact that the strategies (a t ' )o<t<T satisfy the square integrability condition of admissibil- 
ity, the same argument gives: 



(51) 



E 



sup I U, 

0<t<T 



4|2 



< C, 



for 2 < i < N, which clearly implies after summation: 



(52) 



1 N 

-Ye 



sup \Utf 



0<t<T 



<c(l + -E 



For the next step of the proof we introduce the system of decoupled independent and identically 
distributed states 

dX\ = b(t,Xi,fit,a(t,Xl,pk,u(t,Xj)))dt + adWi, < t < T. 

Notice that the stochastic processes X % are independent copies of X and, in particular, F^i = /j, t for 
any t £ [0, T] and i £ {1, ■ ■ • , N}. We shall use the notation: 

{t,Xi,n t ,u(t,Xi)), t£[0,T], i£{l,...,N}. 



Using the regularity of the FBSDE value function u and the uniform boundedness of the family 
(Md+5(p>t))o<t<T derived in Theorem [2] together with the estimate recalled in Lemma [5J we can 
follow Sznitman's proof [26] (see also Theorem 1.3 of [ 16]) and get 

(53) 



max Ef sup \Xj - X l f \ 2 ] < cN- 2 ^ d+4 \ 

l<i<N L <t<T J 



(recall that (X 1 , . .., X N ) solves (|44])), and this implies: 
(54) 



sup E[W2(p?,n)] <cN- 2 '^. 

0<t<T 



PROBABILISTIC ANALYSIS OF MEAN-FIELD GAMES 



21 



Indeed, for each t G [0, T], 
(55) W 2 2 (/2f,Mt) 



i=l v i=l 7 



so that, taking expectations on both sides and using ( [53] ) and Lemma [5} we get the desired estimate 
([54]). Using the local-Lipschitz regularity of the coefficients g and / together with Cauchy-Schwarz 
inequality, we get, for each i S {!,-•• , N}, 



J-J N > i (a N > 1 ,...,a">") 



N,N\ 



E 



< cE 



g(X l T ,fx T )+ I f(t,jQ,fr,a%)db-g(Xtr,p%)- / f(t,Xl,p?,a£> l )dt 

N x -, 1/2 



1 2 



l + |X^| 2 + |^| 2 + i^|X : 
+ c^{e [(l + LY*| 2 + |^| 2 + |«|| 2 + |af'f + 1 £ |X|| 2 ) 



E[\X l T - X^l 2 + WfOur,/^)] 

nl/2 



iVMl/2 



E[\xt-xi\ 2 +\&i 



a 



N,ii2 



J 



for some constant c > which can change from line to line. By (|4"6]), we deduce 



| J- J iV 'V . . . , a N ^)\ < cE[\X l T - X l T \ l + W^ t ,Ut 



ATM 1/2 



+ c / E[|X|-^| 2 + |dj-af' l | 2 + W 2 ( Mt ,/if)]di 



o 



1/2 



Now, by the Lipschitz property of the minimizer a proven in Lemma[T]and by the Lipschitz property 
of u in (fT8l), we notice that 



\a+ — oil 



-iV.ii 



. , |d(t,^,^,«(t,A?))-d(t,^,AH,tt(t,^))|<c|^-X, 
Using ( [53] ) and ([54]), this proves that, for any 1 < i < N, 
(56) J^a 1 '", . . . , a"'*) = J + C^AT 1 /^)). 



This suggests that, in order to prove inequality ( |49[ > for i = 1, we could restrict ourselves to compare 
J N ^(P l , a 2 > N , a N ' N ) to J. Using the argument which led to ([50]), ([51]) and ([52]), together with 
the definitions of and X J for j = 1, • • • ,JV, we get, for any t G [0, T]\ 



E 



E 



sup l^-x 1 ! 2 

0<s<t 



< 



N 



sup |£#-* t « 

0<s<t 



ii 2 



< 



2V 



t AT 

E E 

t 

i=i 



sup | — X 3 r 

0<r<s 



sup I C/j? — X ] r | 

0<r<s 



i|2 



ds + cE / |# - a 
jo 



(is, 2 < % < N. 
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Therefore, using Gronwall's inequality, we get: 



(57) 

so that 
(58) 



1 N 

-Ye 



3=1 



sup \Ul-X t 



3|2 



0<t<T 



0<t<T 



sup_E[|E^ - X*TJ < ^E / |# - af' 1 ! 2 ^, 2 < i < N. 



Putting together ([46]), ( |53) ) and ( [58] ), we see that, for any A > 0, there exists a constant ca depending 
on A such that 



(59) 



E / \ft\ 2 dt < A 



max sup E\\U} - Xf\ 2 ] < c A N- 2 ^ d+4 \ 

2<i<N < t <T 1 J 



Let us fix A > (to be determined later) and assume that E f Q \/3l\ 2 dt < A. Using ([59) we see that 



(60) 



1 N 

l —Y,nui-xi\ 2 ]<c A N- 

3=2 



•2/(d+4) 



for a constant ca depending upon A, and whose value can change from line to line. Now by the 
triangle inequality for the Wasserstein distance: 



E[W 2 (v?,fi t )} <c E 



(61) 



V 3=1 3=2 



Noticing that 



E 



W 



1 N i / ^ N \^^ 

j=2 3=2 7 JJ 



3=1 3=2 

which is 0(A r_1 ) because of ( |50[ ) and ((52]). Plugging this inequality into ( |6"T| ), and using ( [60] ) to 
control the second term and Lemma[5]to estimate the third term therein, we conclude that 

(62) E[w2(v»,h)] KcaN-WW. 

For the final step of the proof we define (Ut)o<t<T as the solution of the SDE 
dtj} = b(t, U},m, 0l)dt + adWf, < t < T, Uq = x, 
so that, from the definition ( |47] > of U 1 we get: 



[bo{s,ji s ) - b (s 



,P* r )]ds+ [ h(s)[U} -tj}]ds. 
Jo 
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Using the Lipschitz property of 60, ( |62l > and the boundedness of 61 and applying Gronwall's inequal- 
ity, we get 



(63) 



sup E[\U^ - U}\ 2 ] < c A N' 

0<t<T 



-2/(d+4) 



so that, going over the computation leading to ( |56| > once more and using ( |62| ), ( |50| >, ( |51| ) and < pz\ : 

J N ^\a N ' 2 , a N > N ) > J(/3 X ) - caN- 1 /^, 
where J(/3 1 ) stands for the mean-field cost of /3 : 



(64) 



Ji/3 1 



E 



Since J < ./(/3 1 ) (notice that, even though /3 1 is adapted to a larger filtration than the filtration of 
W 1 , the stochastic maximum principle still applies as pointed out in Remark[T]), we get in the end 



(65) 



TN.l, a l x jf.2 ^)> J-CaN-W+V, 



and from ( |56| ) and ([65]), we easily derive the desired inequality ( |49| ). Actually, the combination of ([56]) 
and ([65]) shows that (dr^' 1 , . . . , a ' ) is an e-Nash equilibrium for iV large enough, with a precise 
quantification (though not optimal) of the relationship between N and e. But for the proof to be 
complete in full generality, we need to explain how we choose A, and discuss what happens when 

E/ T |#| 2 dt > A. 

Using the convexity in x of g around x = and the convexity of / in (x, a) around x = and 
a = 0, see ((8), we get: 



, a 



N,N\ 



> E 



<?(0,if) + I f(t,0,v? ,0)dt 



T 



N 



+ AE f \/3}\ 2 dt 
Jo 



(E#, d x g(0, if )} + / {{U},d x f(t, 0, if, 0)) + (ft,d a f(t, 0, if, 0)))di 



■E 



The local-Lipschitz assumption with respect to the Wasserstein distance and the definition of the latter 
imply the existence of a constant c > such that for any t G [0, T], 



E [|/(i,0, if ,0) - /(*, 0,^0, 0)|] < cE[l + M 2 2 (if ) 



with a similar inequality for 5. From this, we deduce 



1 + 



1 N 

^5>Dtfi 2 ] 



T 



J JV ' i (/3 i ,a J ^,...,a JV ' JV )> 5 (0,5o)+ / /(t, 0, S , 0)dt 

e' 



(c4A«7(0,i#))+ / ({U},d x f(tA^M + (0tM(tA*?,O)))dt 
Jo 

+ AE / l^pdi-c sup E[|^| : 

./0 L V iV ~^0<t<T 
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By (A.5), we know that d x g, d x f and d a f are at most of linear growth in the measure parameter (for 
the L 2 -norm), so that, for any 5 > 0, there exists a constant c$ such that 



J N > 1 ((3\a N > 2 ,...,a n ' n )>g(0,8 )+ I f(t, 0, 6 , Q)dt + ^E / \f3tfdt 



-N,N 



(66) 



A _ 



}1|2, 



5 sup E\\Ul\ 2 ] 



cs 



0<t<T 



1 N \ 
iV ~^0<t<T / 



Estimates ( |50| ) and ( [51] ) show that one can choose 5 small enough in ([66]) and c so that 



A 



J N >HP\a N > 2 ,...,a»> N ) > -c+ C- - ^)E / lAT^ 



This proves that there exists an integer No such that, for any integer N > No and constant A > 0, 
one can choose A > such that 



(67) 



E / \Pi\*dt > A 



J N ^(P\a N ^ 2 ,...,a N ' N ) >J + A, 



N,N\ 



which provides us with the appropriate tool to choose A and avoid having to consider (fil)o<t<T 
whose expected square integral is too large. □ 



A simple inspection of the last part of the above proof shows that a stronger result actually holds 
when E J Q T \ (3}\ 2 dt < A. Indeed, the estimates ( (50| , ( [59] > and ((62]) can be used as in ((56]) to deduce 
(up to a modification of ca) 

(68) J N '*((3\a N ' 2 ,...,a N > N )>J-c A N- 1 ^ d+ *\ 2 < i < N. 

Corollary 1. Under assumptions (A. 1—7), not only does 



[a t ,l = a(t, X\, fit, u(t, X t l )))i<i<jv) 



0<i<T 



form an approximate Nash equilibrium of the N -player game (|47j-(4<S|) but: 

(i) there exists an integer No such that, for any N > No and A > 0, there exists a constant A > 
such that, for any player i G {1, • • • , A^} and any admissible strategy ft 1 = ((3l)o<t<T, 

(69) E f T '\Pl\ 2 dt>A => J N >\a 1 > N ,...,a i - 1 > N ,f3\a i+1 > N ,...,a N > N )>J + A. 

Jo 

(ii) Moreover, for any A > 0, there exists a sequence of positive real numbers (ejv) w>i converging 
toward 0, such that for any admissible strategy (3 1 = (f3^)o<t<Tfar the first player 

(70) E [ \$\ 2 dt < A =^ min J"'^ 1 ,a 2 > N , . . . ,a N > N ) > J-e N . 

Jo l<i<N 
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5. Appendix: Proof of Lemma|4] 

We focus on the approximation of the running cost / (the case of the terminal cost g is similar) and 
we ignore the dependence of / upon t to simplify the notation. For any n > 1, we define /„ as the 
truncated Legendre transform: 

(71) f n (x,fi,a) = sup inf \(y,x - z) + f(z, n,a)], 

\ y \< n zm dL 

for (x, a) E M. d x M fc and fi £ T^O^)- By standard properties of the Legendre transform of convex 
functions, 

(72) f n (x, n, a) < sup inf [(y, x - z) + f(z, fj,, a)] = f(x, fi, a). 



Moreover, by strict convexity of / in x, 

f n (x,fj,,a)> inf [f(z,fj,,a)]> inf [y\z\ 2 + (d x f(0, /i, a), z)] + /(0, /u, a) 

zeR d zeR d 

( 73 ) 1 

> - — |^/(0,/i,a)| 2 + /(0,^,a), 

so that /„ has finite real values. Clearly, it is also n-Lipschitz continuous in x. 

First Step. We first check that the sequence (f n )n>i converges towards /, uniformly on bounded 
subsets of R d x V 2 (R d ) x R k . So for any given R > 0, we restrict ourselves to \x\ < R and \ a\ < R, 
and \i G V?$L )> sucn that M 2 {[i) < R. By (A.5), there exists a constant c > 0, independent of i?, 
such that 

I -.12 

(74) sup [(y, z) - f(z, fi, a)] > sup [(y, z) - c\z\ 2 ] - c(l + R 2 ) = ^- - c(l + R 2 ). 
Therefore, 

(75) infj(y,x-z) + /(z,/i,a)] < - ^ + c(l + R 2 ). 



By ( p73] > and (A.5), f n {t, x, fi, a) > — c(l + i? 2 ), c depending possibly on 7, so that optimization in 
the variable y can be done over points y* satisfying 

|y*|2 

(76) -c(l + .R 2 ) < - iV- + c(l + # 2 ), that is |y*| < c(l + 

4c 

In particular, for n large enough (depending on i?), 

/ n (x, n, a) = sup inf [(y, x - z) + /(z, /i, a)] = /(x, fi, a). 



So on bounded subsets of M. d x ^(IK^) x /„ and / coincide for n large enough. In particular, 
for n large enough, / n (0, <5o, 0), 9 x / n (0,5o,0) and d a f n (0, So, 0) exist, coincide with /(0, <5o,0), 
d x f(0, 5q, 0) and d a f(0, 5q, 0) respectively, and are bounded by cl as in (A.5). Moreover, still for 



\x\ < i?, |a| < i? and M 2 {[i) < i?, we see from ( [72] ) and ( f76] > that optimization in z can be reduced 
to z* satisfying 

(y\ x-z*} + f(z*, a) < f{x, fi, a) < c(l + R 2 ), 
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the second inequality following from (A.5). By strict convexity of / in x, we obtain 

-c(l + R)\z*\ + -y\z*\ 2 + {d x f(0, fi, a),z*) + /(0, /x, a) < c(l + ii 2 ), 

so that, by (A.5), 7|z*| 2 - c(l + < c(l + R 2 ), that is 

(77) |**| < c(l + R). 

Second Step. We now investigate the convexity property of / n (-, /x, •), for a given /i G ^O^)- F° r 
any /i G K, x, e, y, Z\,z 2 G M. d and a,/3 G with |y| < n and |e|, |/3| < 1, we deduce from the 
convexity of /(•,//,•): 

2 inf z) + /(>,//, a)] 



, > 'zi + za (a + /i/3) + (a-/i/3) 

< ( y, [x + he- zi) + (x - he - z 2 ) ) + 2/ — - — , fi 



2 ,r ' 2 

< (y,x + he — z\) + f(zi,p,, a + + (y,x — he — z 2 ) + f(z 2 ,fi, a — hf3) — 2Xh 2 . 
Taking infimum with respect to Z\ , z 2 and supremum with respect to y, we obtain 

(78) f n (x, fi, a) < -f n {% + he, fi,a + h/3) + -f n (x - he, fi,a- h0) - Xh 2 . 

In particular, the function M d x M fc 3 (x, a) <—} f n (x, fx, a) — A|q| 2 is convex. We prove later on that 
it is also continuously differentiable so that ([8]> holds. 

In a similar way, we can investigate the semi-concavity property of _/„(-, /u, •). For any /i6l, 
x,e,yi,y 2 G M d , a,/3 G M k , with |yi|, \y 2 \ < n and \e\, \f3\ < 1, 

inf \(yi,x + he - z) + f(z, u,a+ h/3)] + inf \(y 2 , x - he - z) + /(z, u,a — hf})\ 
= inf \(yi,x - z) + /(z + /ie, u, a + fy3)l + inf \(y 2 , x - z) + f(z - he, u,a — h/3)] . 

By expanding /(•,//, •) up to the second order, we see that 

inf \(yi,x + he — z) + /(z, u,a + h(3) \ + inf [(2/2, x — he — z) + /(z, u,a — h/3)] 

zGR d z&R d 

< inf [(yi + y 2 , x - z) + 2f(z, fi, a)] + c|/i| 2 , 



for some constant c. Taking the supremum over y\,y 2 , we deduce that 

f n (x + he,fi,a + hp) + f n (x - he,fi,a - h/3) - 2f n (x,p,,a) < c\h\ 2 . 

So for any \i G V 2 (M. d ), the function M. d x M. k B (x, a) > f n {x, fi, a) — c[\x\ 2 + \a\ 2 ] is concave and 
/„(•, jU, •) is C 1,1 , the Lipschitz constant of the derivatives being uniform in n > 1 and /i G T^O^)- 
Moreover, by definition, the function •) is n-Lipschitz continuous in the variable x, that is 

9 x /n is bounded, as required. 

Third Step. We now investigate (A.5). Given 5 > 0, R > and n > 1, we consider x G M d , 
a G G P 2 (K d ) such that 

(79) max(|x|,H,M 2 ( M ),M 2 (//)) < R, W 2 (n,fj!) < S. 
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By (A.5) and f77] >, we can find a constant c' (possibly depending on 7) such that 



(80) 



f n (x, p, a) = sup inf Uy, x - z) + f(z, p' , a)] 

\y\< n \z\< C (l+R) L 

< sup inf Uy, x - z) + f(z, p, a) + c.l(1 + R + \z\)5] 

\y\<n z < c i 1 + R ) 

= sup inf \(y, x - z) + f(z, p, a)] + c'(l + R)S. 

This proves local Lipschitz-continuity in the measure argument as in (A.5). 

In order to prove local Lipschitz-continuity in the variables x and a, we use the C ' -property. 
Indeed, for x, p and a as in (f79]), we know that 



(81) \d x f n (x,p,a)\ + \d a f n (x,p,a)\ < \d x f n (0, p, 0)| + |9 a / n (0, p, 0)| + aR. 

By ([72]), for any integer p > 1, there exists an integer n p , such that, for any n > n p , f n (0, p, 0) and 
/(0, /x, 0) coincide for M 2 (p) < p. In particular, for n > n p , 

(82) |Q e /n(0,M.0)| + \d a f n (0,[x,a)\ < c(l + M 2 (p)) whenever M 2 (/x) < p, 

so that ( |8T] ) implies (A.5) whenever n > n p and M%{ij) < p. We get rid of these restrictions by 
modifying the definition of f n . Given a probability measure /i € ^(I^ ') and an integer p > 1, we 
define ^p^) as the push-forward of p, by the mapping M d 3 x > [max(M2(y«), p)l X px so that 
G P2(K d ) and M 2 ($ p (p)) < min(p, M 2 (p)). Indeed, if X has /i as distribution, then the 
r.v. X p = pXj m.&x.(M 2 (p),p) has $ p (p) as distribution. It is easy to check that <J> P is Lipschitz 
continuous for the 2-Wasserstein distance, uniformly in n > 1. We then consider the approximating 
sequence 

f p : R d x V 2 (R d ) xR k B (x, p, a) /„ p (x, a), p > 1, 

instead of (f n )n>i itself. Clearly, on any bounded subset, f p still coincides with / for p large enough. 
Moreover, the conclusion of the second step is preserved. In particular, the conclusion of the second 



step together with ( |80| ), ( |8T| ) and ([82]) say that (A.5) holds (for a possible new choice of cl). From 
now on, we get rid of the symbol "hat" in (f p ) p >i and keep the notation (f n ) n >i for {f p ) p >i. 

Fourth Step. It only remains to check that f n satisfies the bound (A.6) and the sign condition (A.7). 
Since \d a f(x, p,0)\ < cl, the Lipschitz property of d a f implies that there exists a constant c > 
such that \d a f(x, p, a)\ <cforall (x,p,a) ef'x V 2 (M. d ) x M. k with \a\ < 1. In particular, for any 
n > 1, it is plain to see that f n (x, p, a) < f n (x, p, 0) + c\a\, for any (x, p, a) G R d x V 2 (M. d ) x M. k 
with \a\ < 1, so that \d a f n (x, p,0)\ < c. This proves (A.6). 

Finally, we can modify the definition of f n once more to satisfy (A.7). Indeed, for any R > 0, there 
exists an integer ur, such that, for any n > ur, f n (x, p, a) and f(x, p, a) coincide for (x, p, a) G 
R d xV 2 (R d ) xR k with \x\,\a\,M 2 (p) < R so that (x,d x f n (0,5 x ,0)) > -c L (l + \x\), for |x| < i? 
and n > Ur. Next we choose a smooth function ^ : R d )■ R d , satisfying 1*0(^)1 < 1 f° r an y ^ ^ 
V'(x) = x for |x| < 1/2 and ?/>(x) = x/|x| for \x\ > 1, and we set f p (x, p, a) = f np (x, ty p (p),a>) 
for any integer p > 1 and (x, p, a) G x ) x I^ fe where ^ p (p) is the push-forward of p by 

the mapping ^3x41- p + pip{p _1 (/x) ). Recall that /Z stands for the mean of p. In other words, 
if X has distribution p, then X p = X - E(X) + pil){p' 1 ¥,{X)) has distribution * p (/x). 
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is Lipschitz continuous with respect to W2, uniformly in p > 1. Moreover, for any R > 
and p > 2R, M2(n) < R implies | J Rd x'dfi(x')\ < R so that f Rd x'dp(x')\ < 1/2, that is 
^p(n) = p and, for \x\, \a\ < R, f p (x,p,a) = f np (x,p,a) = f(x,p,a). Therefore, the sequence 
(fp)p>i is an approximating sequence for / which satisfies the same regularity properties as {f n )n>\- 
In addition, 

(x,d x f p (0,S x ,0)) = (x,d x f np (0,Sp^p-i x ),0)) = (x,d x f(0,Sp^ p -i x) ,0)} 

for x £ R rf . Finally we choose ip(x) = [p{\x\)/\x\\x (with V'(O) = 0), where p is a smooth non- 
decreasing function from [0, +00) into [0, 1] such that p(x) = x on [0, 1/2] and p(x) = 1 on [1, +00). 
If x / 0, then the above right-hand side is equal to 

(x,d x f(0,6 pi , ip -i x) ,0)) = j^p^(pi)(p' 1 x),d x f(0J p ^ p -i x) ,0)} 

a - Ci ^) (1 + |p *"' l)l) ' 

For \x\ < p/2, we have p{p~ l \x\) = Ip" 1 ^!, so that the right-hand side coincides with — cl(1 + \x\). 
For \x\ > p/2, we have p(p~ 1 \x\) > 1/2 so that 

~ Ju-il) + > ^p-^xKl + lmPip^x)]) > -2p- 1 |x|(l+p) > ~4|g|. 

This proves that (A.7) holds with a new constant. □ 
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